Table: 
    data_dev_scores

Description:
    This table contains information about reviews and the scientists who wrote them.
    Each row represents one review/reviewer

Columns: 
    RowID: [integer] Anonymous unique review identifier
    JournalID: [integer] Anonymous unique Journal identifier
    PRType: [string/categorical] Peer review type {unpublished, open} 
    IsSigned: [boolean] Indicates if the reviewer has signed the review {0,1}
    IFQuartile: [string/categorical] JCR quartile of the journal 2020 {Q1, Q2, Q3, Q4, NI/not indexed}
    Gender: [string/categorical] Gender guessed as described in the paper {male, female}
    Country: [string/categorical] Scientist country as defined in the United Nations Statistics Division M49 standard
    ContinentalRegion: [string/categorical] Scientist affiliation continental region as defined in the United Nations Statistics Division M49 standard
    StaticalRegion: [string/categorical] Scientist affiliation statical region as defined in the United Nations Statistics Division M49 standard
    ExampleCount: [integer] Total number of sentences labeled as examples
    CriticismCount: [integer] Total number of sentences labeled as criticism
    SuggestionAndSolutionCount: [integer] Total number of sentences labeled as suggestion and solution
    ImportanceAndRelevanceCount: [integer] Total number of sentences labeled as importance and relevance
    MaterialsAndMethodsCount: [integer] Total number of sentences labeled as materials and methods
    PraiseCount: [integer] Total number of sentences labeled as praise
    PresentationAndReportingCount: [integer] Total number of sentences labeled as presentation and reporting
    ResultsAndDiscussionCount: [integer] Total number of sentences labeled as results and discussion
    MultipleCategoriesCount: [integer] Total number of sentences assigned to more than one category
    NoCategoryCount: [integer] Total number of sentences assigned to no category
    SingleCategoryCount: [integer] Total number of sentences assigned to exactly one category
    Example: [float] Example dimension from the Developmental Score
    Criticism: [float] Criticism dimension from the Developmental Score
    SuggestionAndSolution: [float] Suggestion and solution dimension from the Developmental Score 
    ImportanceAndRelevance: [float] Importance and relevance dimension from the Developmental Score
    MaterialsAndMethods: [float] Materials and methods dimension from the Developmental Score
    Praise: [float] Praise dimension from the Developmental Score
    PresentationAndReporting: [float] Presentation and reporting dimension from the Developmental Score
    ResultsAndDiscussion: [float] Results and discussion dimension from the Developmental Score
    Score: [float] Developmental Score as in the paper 
    EuNaAu: [boolean] Indicates whether the scientist's affiliation is in a Western Region.
	TRUE: The ContinentalRegion is one of Europe, North America, or Oceania.
	FALSE: The ContinentalRegion is outside of these regions.
    SentenceCount: [float] Number of sentences in the review.
    WordCount: [float] Number of words in the review.


Table: 
    data_gini_by_manuscript
    
Description:
    This table contains the Gini Index for the developmental score of all reviews per manuscript.
    Each row represents one manuscript

Columns:
    RowID: [integer] Anonymous unique manuscript identifier
    IFQuartile: [string/categorical] JCR quartile of the journal 2020 {Q1, Q2, Q3, Q4, NI/not indexed}
    PRType: [string/categorical] Peer review type {unpublished, open} 
    GiniIndex: [float] Gini Index calculated for all reviews of a manuscript
    GenderAgg: [string/categorical] Aggregated classification of reviewers' gender for the manuscript:
	All women: All reviewers identified as women.
	All men: All reviewers identified as men.
	Mixed: Manuscript reviewed by a mix of men and women.
    EuNaAuAgg: [string/categorical] Aggregated classification of reviewers' geographical affiliation for the manuscript:
	All EuNaAu: All reviewers are from Western Regions (Europe, North America, or Oceania).
	All rest: All reviewers are from other regions.
	Mixed: Manuscript reviewed by a mix of reviewers from Western and non-Western regions



Table:
    data_gini_by_scientist
    
Description:
    This table contains the Gini Index for the developmental score for all reviews done by each scientist.
    Each row represents one scientist
    
Columns:
    RowID: [integer] Anonymous unique scientist identifier
    Gender: [string/categorical] Gender guessed as described in the paper {male, female}
    ContinentalRegion: [string/categorical] Scientist affiliation continental region as defined in the United Nations Statistics Division M49 standard
    Seniority: [string/categorical] Number of years since the scientist's first publication {<5 years, 5 to 18 years, >18 years} 
    GiniIndex: [float] Gini Index calculated for all reviews of a scientist
    NumReviews: [string/categorical] Number of reviews used to calculate the scientist's Gini Index
    EuNaAu: [boolean] Indicates whether the scientist's affiliation is in a Western Region.
	TRUE: The ContinentalRegion is one of Europe, North America, or Oceania.
	FALSE: The ContinentalRegion is outside of these regions.
    GenderAgg: [string/categorical] Aggregated classification of reviewers' gender for the manuscript:
	All women: All reviewers identified as women.
	All men: All reviewers identified as men.
	Mixed: Manuscript reviewed by a mix of men and women.
    EuNaAuAgg: [string/categorical] Aggregated classification of reviewers' geographical affiliation for the manuscript:
	All EuNaAu: All reviewers are from Western Regions (Europe, North America, or Oceania).
	All rest: All reviewers are from other regions.
	Mixed: Manuscript reviewed by a mix of reviewers from Western and non-Western regions


Table:
    data_readability_scores

Description:
    This table contains the essential context and calculated readability metrics for peer reviews, focusing on the reviewer's general attributes, journal characteristics, and various readability scores.

Columns:
    Journal: [string/categorical] Anonymous unique Journal identifier.
    ManuscriptID: [integer] Anonymous unique Manuscript identifier.
    ScientistID: [integer] Anonymous unique Scientist (reviewer) identifier.
    Area: [string/categorical] The scientific Area or field of the manuscript.
    IFQuartile: [string/categorical] JCR quartile of the journal (e.g., Q1, Q2, Q3, Q4, NI).
    PRType: [string/categorical] Peer review type (e.g., unpublished, open).
    IsSigned: [boolean] Indicates if the reviewer has signed the review (0 or 1).
    Gender: [string/categorical] Gender of the scientist (e.g., male, female).
    ContinentalRegion: [string/categorical] Scientist affiliation continental region.
    hf_readability_grade_deberta: [float] Readability grade estimated by the deberta model.
    hf_readability_grade_mdeberta: [float] Readability grade estimated by the mdeberta model.
    flesch_interpretation: [string/categorical] Qualitative interpretation of Flesch readability.
    flesch_ease: [float] Flesch Reading Ease Score.
    flesch_grade: [float] Flesch-Kincaid Grade Level.
    gunning_fog: [float] Gunning Fog Index score.
    smog: [float] SMOG Index score.
    ari: [float] Automated Readability Index (ARI) score.
    coleman_liau: [float] Coleman-Liau Index score.
